Search CORE

215 research outputs found

Hawkeye: An interactive visual analytics tool for genome assemblies

Author: Phillippy A. M.
Salzberg S. L.
Schatz M. C.
Shneiderman B.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Genome sequencing remains an inexact science, and genome sequences can contain significant errors if they are not carefully examined. Hawkeye is our new visual analytics tool for genome assemblies, designed to aid in identifying and correcting assembly errors. Users can analyze all levels of an assembly along with summary statistics and assembly metrics, and are guided by a ranking component towards likely mis-assemblies. Hawkeye is freely available and released as part of the open source AMOS project http://amos.sourceforge.net/hawkeye. © 2007 Schatz et al.; licensee BioMed Central Ltd

Crossref

Cold Spring Harbor Laboratory Institutional Repository

Springer - Publisher Connector

PubMed Central

Digital Repository at the University of Maryland

The evolution of the natural killer complex; a comparison between mammals using new high-quality genome assemblies and targeted annotation.

Author: Bickhart Derek M
Gibson Mark S
Hammond John A
Heimeier Dorothea
Koren Sergey
Medrano Juan F
Phillippy Adam M
Schwartz John C
Smith Timothy PL
Publication venue: eScholarship, University of California
Publication date: 01/01/2017
Field of study

Natural killer (NK) cells are a diverse population of lymphocytes with a range of biological roles including essential immune functions. NK cell diversity is in part created by the differential expression of cell surface receptors which modulate activation and function, including multiple subfamilies of C-type lectin receptors encoded within the NK complex (NKC). Little is known about the gene content of the NKC beyond rodent and primate lineages, other than it appears to be extremely variable between mammalian groups. We compared the NKC structure between mammalian species using new high-quality draft genome assemblies for cattle and goat; re-annotated sheep, pig, and horse genome assemblies; and the published human, rat, and mouse lemur NKC. The major NKC genes are largely in the equivalent positions in all eight species, with significant independent expansions and deletions between species, allowing us to propose a model for NKC evolution during mammalian radiation. The ruminant species, cattle and goats, have independently evolved a second KLRC locus flanked by KLRA and KLRJ, and a novel KLRH-like gene has acquired an activating tail. This novel gene has duplicated several times within cattle, while other activating receptor genes have been selectively disrupted. Targeted genome enrichment in cattle identified varying levels of allelic polymorphism between the NKC genes concentrated in the predicted extracellular ligand-binding domains. This novel recombination and allelic polymorphism is consistent with NKC evolution under balancing selection, suggesting that this diversity influences individual immune responses and may impact on differential outcomes of pathogen infection and vaccination

Springer - Publisher Connector

PubMed Central

Repositório da Universidade Nova de Lisboa

eScholarship - University of California

Mauve Assembly Metrics

Author: Aaron E. Darling
Altschul
Andrew Tritt
Bergeron
Darling
Darling
Hartman
Jonathan A. Eisen
Kurtz
Marc T. Facciotti
Phillippy
Rissman
Publication venue: Oxford University Press
Publication date: 01/01/2011
Field of study

Summary: High-throughput DNA sequencing technologies have spurred the development of numerous novel methods for genome assembly. With few exceptions, these algorithms are heuristic and require one or more parameters to be manually set by the user. One approach to parameter tuning involves assembling data from an organism with an available high-quality reference genome, and measuring assembly accuracy using some metrics

CiteSeerX

Crossref

OPUS - University of Technology Sydney

PubMed Central

eScholarship - University of California

Re-Assembly of the Genome of Francisella tularensis Subsp. holarctica OSU18

Author: A Johansson
AM Phillippy
AM Phillippy
D Gordon
Daniela Puiu
DT Dennis
EW Myers
JF Petrosino
JR White
L Rohmer
M Enserink
M Pop
Matthew W. Hahn
MC Schatz
P Havlak
S Kurtz
SL Salzberg
SL Salzberg
Steven L. Salzberg
Publication venue: Public Library of Science
Publication date: 17/10/2008
Field of study

Francisella tularensis is a highly infectious human intracellular pathogen that is the causative agent of tularemia. It occurs in several major subtypes, including the live vaccine strain holarctica (type B). F. tularensis is classified as category A biodefense agent in part because a relatively small number of organisms can cause severe illness. Three complete genomes of subspecies holarctica have been sequenced and deposited in public archives, of which OSU18 was the first and the only strain for which a scientific publication has appeared [1]. We re-assembled the OSU18 strain using both de novo and comparative assembly techniques, and found that the published sequence has two large inversion mis-assemblies. We generated a corrected assembly of the entire genome along with detailed information on the placement of individual reads within the assembly. This assembly will provide a more accurate basis for future comparative studies of this pathogen

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Inositol 1,3,4,5,6-pentakisphosphate 2-kinase is a distant IPK member with a singular inositide binding site for axial 2-OH recognition

Author: Agarwal
B. Gonzalez
BOZSIK
C. A. Brearley
Caddick
Caddick
Cheek
Collaborative Computational Project Number
Cowtan
Gonz lez
Graf
Hanakahi
Irvine
J. I. Banos-Sanz
J. Sanz-Aparicio
M. Villate
Maffucci
Messenguy
Michell
Miller
Miller
Mulugu
Murphy
Murshudov
Nolen
Phillippy
Raboy
Raboy
Rowan
Sarmah
Shamsuddin
Sheldrick
Shukla
Sweetman
Vonrhein
York
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date: 07/05/2010
Field of study

Inositol phosphates (InsPs) are signaling molecules with multiple roles in cells. In particular Graphic (InsP6) is involved in mRNA export and editing or chromatin remodeling among other events. InsP6 accumulates as mixed salts (phytate) in storage tissues of plants and plays a key role in their physiology. Human diets that are exclusively grain-based provide an excess of InsP6 that, through chelation of metal ions, may have a detrimental effect on human health. Ins(1,3,4,5,6)P5 2-kinase (InsP5 2-kinase or Ipk1) catalyses the synthesis of InsP6 from InsP5 and ATP, and is the only enzyme that transfers a phosphate group to the axial 2-OH of the myo-inositide. We present the first structure for an InsP5 2-kinase in complex with both substrates and products. This enzyme presents a singular structural region for inositide binding that encompasses almost half of the protein. The key residues in substrate binding are identified, with Asp368 being responsible for recognition of the axial 2-OH. This study sheds light on the unique molecular mechanism for the synthesis of the precursor of inositol pyrophosphates

Crossref

PubMed Central

Digital.CSIC

University of East Anglia digital repository

Statistical Analysis of Microarray Data with Replicated Spots: A Case Study with Synechococcus WH8102

Author: Brahamsha B.
Elbourne L. D. H.
Haaland D. M.
Palenik B.
Paulsen I. T.
Phillippy K. H.
Thomas E. V.
Timlin J. A.
Publication venue: Hindawi Publishing Corporation
Publication date: 01/01/2009
Field of study

Until recently microarray experiments often involved relatively few arrays with only a single representation of each gene on each array. A complete genome microarray with multiple spots per gene (spread out spatially across the array) was developed in order to compare the gene expression of a marine cyanobacterium and a knockout mutant strain in a defined artificial seawater medium. Statistical methods were developed for analysis in the special situation of this case study where there is gene replication within an array and where relatively few arrays are used, which can be the case with current array technology. Due in part to the replication within an array, it was possible to detect very small changes in the levels of expression between the wild type and mutant strains. One interesting biological outcome of this experiment is the indication of the extent to which the phosphorus regulatory system of this cyanobacterium affects the expression of multiple genes beyond those strictly involved in phosphorus acquisition

Crossref

Directory of Open Access Journals

PubMed Central

Macquarie University ResearchOnline

NCBI GEO: archive for high-throughput functional genomic data

Author: A. Soboleva
Ball
Barrett
Brazma
C. Evangelista
Chen
D. B. Troup
D. Rudnev
Edgar
Edgar
I. F. Kim
K. A. Marshall
K. H. Phillippy
M. Tomashevsky
Meissner
P. Ledoux
P. M. Sherman
R. Edgar
R. N. Muertter
S. E. Wilhite
T. Barrett
Publication venue: Oxford University Press
Publication date: 01/01/2009
Field of study

The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) is the largest public repository for high-throughput gene expression data. Additionally, GEO hosts other categories of high-throughput functional genomic data, including those that examine genome copy number variations, chromatin structure, methylation status and transcription factor binding. These data are generated by the research community using high-throughput technologies like microarrays and, more recently, next-generation sequencing. The database has a flexible infrastructure that can capture fully annotated raw and processed data, enabling compliance with major community-derived scientific reporting standards such as ‘Minimum Information About a Microarray Experiment’ (MIAME). In addition to serving as a centralized data storage hub, GEO offers many tools and features that allow users to effectively explore, analyze and download expression data from both gene-centric and experiment-centric perspectives. This article summarizes the GEO repository structure, content and operating procedures, as well as recently introduced data mining features. GEO is freely accessible at http://www.ncbi.nlm.nih.gov/geo/

CiteSeerX

Crossref

PubMed Central

Interactive metagenomic visualization in a Web browser

Author: A Brady
A Brady
A Dix
Adam M Phillippy
B Johnson
B Shneiderman
Brian D Ondov
DH Huson
EW Sayers
F Meyer
G Draper
GW Tyson
J Goecks
J Goll
J Qin
J Stasko
J Yang
JC Wooley
K Andrews
Nicholas H Bergman
Q Wang
S Mitra
SF Altschul
VD Pham
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background A critical output of metagenomic studies is the estimation of abundances of taxonomical or functional groups. The inherent uncertainty in assignments to these groups makes it important to consider both their hierarchical contexts and their prediction confidence. The current tools for visualizing metagenomic data, however, omit or distort quantitative hierarchical relationships and lack the facility for displaying secondary variables. Results Here we present Krona, a new visualization tool that allows intuitive exploration of relative abundances and confidences within the complex hierarchies of metagenomic classifications. Krona combines a variant of radial, space-filling displays with parametric coloring and interactive polar-coordinate zooming. The HTML5 and JavaScript implementation enables fully interactive charts that can be explored with any modern Web browser, without the need for installed software or plug-ins. This Web-based architecture also allows each chart to be an independent document, making them easy to share via e-mail or post to a standard Web server. To illustrate Krona's utility, we describe its application to various metagenomic data sets and its compatibility with popular metagenomic analysis tools. Conclusions Krona is both a powerful metagenomic visualization tool and a demonstration of the potential of HTML5 for highly accessible bioinformatic visualizations. Its rich and interactive displays facilitate more informed interpretations of metagenomic analyses, while its implementation as a browser-based application makes it extremely portable and easily adopted into existing analysis packages. Both the Krona rendering code and conversion tools are freely available under a BSD open-source license, and available from: <url>http://krona.sourceforge.net</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central